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Abstract 

Non-orthogonal multiple access (NOMA) systems have the potential to deliver higher system 
throughput, compared to contemporary orthogonal multiple access techniques. For a linearly precoded 
multiple-input multiple-output (MISO) system, we study the downlink sum rate maximization problem, 
when the NOMA principles are applied. Being a non-convex and intractable optimization problem, 
we resort to approximate it with a minorization-maximization algorithm (MMA), which is a widely 
used tool in statistics. In each step of the MMA, we solve a second-order cone program, such that the 
feasibility set in each step contains that of the previous one, and is always guaranteed to be a subset of 
the feasibility set of the original problem. It should be noted that the algorithm takes a few iterations to 
converge. Furthermore, we study the conditions under which the achievable rates maximization can be 
further simplified to a low complexity design problem, and we compute the probability of occurrence of 
this event. Numerical examples are conducted to show a comparison of the proposed approach against 
conventional multiple access systems. NOMA is reported to provide better spectral and power efficiency 
with a polynomial time computational complexity. 
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I. Introduction 

Efficient multiple access techniques in wireless systems has long been a sought after desirable 
feature. Several facets haven been eonsidered, while dealing with the design of multiple access 
sehemes. For example, spectral efficiency, reliability and quality of serviee, effieient utilization of 
radio resourees, and reeently, energy effieieney are some of the objeetives, that form the basis of 
multiple access techniques in wireless eommunication systems. Non-orthogonal multiple aeeess 
(NOMA) has been conceived as a breakthrough technology for fifth generation (5G) wireless 
systems m-a. The main themes of 5G networks, namely, redueed lateney, high eonneetivity, 
and ultra-fast speeds are being attributed to devising systems working on the prineiples of NOMA 
m. NOMA uses power domain to multiplex additional users in the time/frequeney/code, slot 
already oeeupied by a mobile deviee. The enabling teehniques for NOMA are not new and 
find their roots in some old ^nnci^lts-superposition coding (SC) and successive interference 
cancellation (SIC). SC was first proposed by Cover in dH, as an aehieveability scheme for a 
degraded broadeast ehannel. Likewise, various versions of SIC have been employed in the past 
in systems like Vertical-Bell Laboratories Layered Spaee-Time (V-BLAST) and Code Division 
Multiple Aeeess (CDMA) [[ 51 , [ 0 . Therefore, in addition to being a candidate for the next 
generation of 5G wireless networks, it is very important that NOMA has also the potential to 
integrate well with existing multiple aeeess paradigms. 

In NOMA, the base station (BS) transmits a superposition eoded signal, whieh is a sum of all 
messages of the users. The users are arranged with respeet to their effeetive ehannel gains i.e., 
the one with the lowest gain is assumed to be at the bottom of the sequence, the one with the 
highest gain at the top, while the remaining are arranged in an inereasing order between the two. 
NOMA ensures that the weaker users reeeive a higher fraetion of the total power budget. When 
a stronger user is allowed to access the slot being occupied by a weaker one, its signal does not 
adversely impact the performance of the weaker user, as it is already experieneing a channel 
fade. At the same time, the stronger user ean get rid of the interferenee due to the weaker one, 
by applying a SIC operation. In traditional orthogonal multiple access schemes, once the slot has 
been reserved for a user, other users are prohibited from aeeessing that. This, of eourse, has a 
negative impaet on the aggregate systems’s throughput. The major outeome of sharing the same 
ehannel slot is that the sum rates are expected to improve, and with intelligent power allocation 
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the weaker users ean also be efficiently served. 

A. Literature 

To the best of our knowledge, as of today, NOMA has mostly been explored for single-input 
single-output (SISO) systems. For example, in Q Ding et al. studied NOMA for the downlink 
of a cellular system, and by assuming fixed powers, they derived expressions for the aggregate 
ergodic sum rate and outage probability of a particular user. Interestingly, in that paper it was 
concluded that in the absence of a judiciously chosen target data rate, a user can always be in 
outage. For multiple-input multiple-output (MIMO) systems, Lan et al. [[81, explored the impact 
of error propagation of SIC and user velocity on the NOMA performance. Their results showed 
that even in the worst error propagation scenario, NOMA outperforms conventional orthogonal 
multiple access and can yield performance gains for different user mobility. Chen et al. [0, 
studied NOMA for the downlink of a wireless system, when BS and receivers are each equipped 
with two antennas. Traditional minimum-mean-squared-error (MMSE) precoding matrices have 
been used, which do not guarantee maximum throughput for a given user ordering. Similarly, 
Timotheou et al. m, studied the power allocation for NOMA in a SISO system from a fairness 
point of view. Finally, Ding et al., investigated MIMO-NOMA in ifTTl . and derived outage 
probabilities for fixed and more sophisticated power allocation schemes. 

B. Contributions 

In this paper, we focus on the downlink of a multiple-input single-output (MISO) system, 
in which the transmit signals of each user are multiplied by a complex precoding vector. 
The goal is to design these vectors in order to maximize the total throughput of the system, 
while simultaneously satisfying the NOMA constraints. To solve this problem we rely on the 
approximation technique that has been commonly dubbed as concave-convex procedure (CCP jl] or 
minorization-maximization algorithm (MMa]^ |[T^ - |[T71I . Under the different name of sequential 
convex programming a parametric approach has been proposed in |[T^ . Recently, in the context 
of weighted sum rate maximization and physical layer multicasting, similar ideas were used 

*If the original problem is a minimization instead of a maximization, the procedure has been referred to as convex-concave 
procedure (CCP). 

^The MMA has also been called as majorization-minimization algorithm if the original problem is a minimization problem. 
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by Hanif et al. and Tran et al. in [[T^ . Il20l . respectively. Due to the flexible nature of MMA 
approach, these ideas have also be used in image processing applications [l2n . 

The main contributions of this paper can be summarized as follows: 

. By incorporating decodability constraints to ensure that better users can perform SIC, 
we provide a novel mathematical programming based approach to solve the sum rate 
maximization problem in the downlink of a MISO system, relying on NOMA principles. 
Similarly, constraints are also included to guarantee that the desired signals of the weaker 
users are strong enough to render them non-zero data rates. 

• Using the MMA concept, we develop an iterative algorithm that solves the NOMA sum 
rate maximization problem and obtains complex precoding vectors, which maximize the 
aggregate throughput. Unlike traditional approaches that rely on semidefinite programming 
(SDP), to deal with such optimization problems, the MMA based algorithm solves a second- 
order cone program (SOCP) in each step. 

. We show that the proposed algorithm is provably convergent in few iterations. Moreover, a 
complexity analysis is also carried out to show that the worst case complexity of the SOCP, 
which we solve in each run, is just polynomial in design dimensions. Furthermore, under 
plausible assumptions, the algorithm converges to the Karush-Kuhn-Tucker (KKT) point of 
the original problem. 

• We present an approximation to the original optimization program, with the main goal 
of complexity reduction. To provide more insight, we study conditions under which this 
approximation is tight. Moreover, for the special case of orthogonal precoding vectors, we 
provide a probabilistic insight regarding the tightness of the proposed approximation. 

. Finally, numerical examples are presented to show the validity of the proposed algorithm. 
These results reveal that the NOMA transmission outperforms the conventional orthogonal 
multiple access schemes, particularly when the transmit signal-to-noise ratio (SNR) is low, 
and the number of users are greater than the number of BS antennas. We also investigate 
the scenario, where the proposed approximation exactly matches the original problem. In 
this case, it is shown that the distance between the users and the BS plays a crucial role 
and affects the system’s throughput. 
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C. Structure 

The rest of the paper is organized as follows. In Seetion [III we describe the system model 
and formulate the problem. In Section [Till we present the preliminaries, needed to outline the 
algorithm in the next section. The algorithm is developed and analysed in Section |IVl while a 
reduced complexity approximation is motivated and developed in Section |Vl Finally, numerical 
results and conclusions are presented in Sections |Vl] and IVIIl respectively. 

D. Notations 

Bold uppercase and lowercase letters are used to denote matrices and vectors, respectively. 
The symbols C"^, M" and M” are used for n-dimensional complex, real, and nonnegative real 
spaces, respectively. For a vector e, its j* coordinate is denoted by e*. Furthermore, ||e ||2 is used 
to represent I 2 norm of a vector e e €"■, which is defined as ||e ||2 = kjP’ where |ej| is 
the absolute value of e^. 0(.) is reserved for complexity estimates. Unless otherwise specified, 
calligraphic symbols are used to represent sets, [x] is the ceiling function, which returns the 
smallest integer not less than x. Ve denotes gradient of a vector e. min(.) gives the minimum 
of the quantities passed as its argument. 3?(c) and 9=(c) denote the real and imaginary parts 
of a complex number c, respectively. Ft{E) denotes the probability of event E. Any new or 
unconventional notation used in the paper is defined in the place where it occurs. 

II. System Setup 

We consider the downlink of a BS, equipped with T antennas and serving N single antenna 
users. NOMA principle is used for transmission purposes (please refer to Fig. [T]). We further 
assume that the transmitted signal of each user equipment (UE) is linearly weighted with a 
complex vector. Specifically, to all N users, the BS transmits a superposition of the individual 
messages, WjSj for all i, where Wj G and Sj are the complex weight vector and the transmitted 
symbol for UE-i, respectively. Therefore, under frequency flat channel conditions the received 
signal Hi at UE-f is 

( N \ N 

^ WjSj j + rii = ^ hf WjSj +ni, i = 1,..., (1) 

i=i / i=i 

where hj = \Jdpg,i G C^, with di being the distance between UE and the BS, 7 is the path 
loss exponent, g* ~ CAA(0,I), and n* represents circularly symmetric complex Gaussian noise 
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T 



Fig. 1. The system setup. A BS with T antennas serves N users. The user UE-fc receives interference from the users UE-fc+ 1 
to UE-A^. The signals of remaining users from UE-1 to UE-fc — 1 are cancelled at UE-fc. 


with variance Subsequently, NOMA proposes to employ SIC at individual UEs, based on 
the partieular ordering. For instanee, the works in [|3, iQ use the faet that for a single-input 
single-output (SISO) system, onee the ehannels are arranged in a partieular order (inereasing or 
deereasing), then a UE-/c deeodes all those UE-i signals, whose index i < k (inereasing order) 
and i > k (deereasing order). An illustration of this proeess is also given in Fig. [T] However, 
simple SISO ordering eannot be transformed to the MISO setup. The present work does not 
foeus on the optimal ordering problem, but in the design of the eomplex weighting veetors, w*, 
that maximize the aggregate throughput of the system, for a given UE ordering. Next, we assume 
that the ehannel state information (CSI) is perfeetly known at all nodes. 
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A. Problem Formulation 


We assume that the UE-1 is the weakest (and henee eannot deeode any interfering signals), 
while UE-A^ is the strongest user, and is able to nullify all other UE interferenee by performing 
SIC. The other UEs are placed in an increasing order with respect to their index numbers. Eor 
instance, UE-m is placed before UE-tt, if index m < n. Increasing channel strengths can be used 
to order the users. But, as mentioned above, this ordering may not be optimal, and better rates 
may be achievable for different order of users. According to NOMA the achievable rate after 
SIC operation at the user, with /c > z for alH = 1,..., /c — 1, is O, Q 


Rl = l0g2 1 + 


IhfwA 


2^j= 


1 < k < N -1. 


( 2 ) 




|hfwjp + 0-2 

An important observation should be noted here. Eor the above rate to be achievable at UE-/c, it 
is necessary for all UE-j, with j > k, to satisfy 

Ihfwfcp 


m = logs 1 + 


^-~\N 

A^ra=k 


Ihfw^P + 0-2 


^ Rth j — k + 1,N 


(3) 


jm=k+l 

where Rj is the rate of UE-j to decode the message of fc* UE, and Rth is some target data rate 
for user In addition, to allocate non-trivial data rates to the weaker users, which present a 
lower decoding capability in a given order, the following condition must also be satisfied 

IhfcWiI^ > ... |lifcWfc_i|^ > IhfcWfcl^ > |hfcWfc+i|^ ... > |hfcW7v|^. (4) 


As a further insight, (jS]) ensures that the signal-to-interference-plus-noise ratio (SINK) of UE- 
j to decode the message of UE-/c, where j > k, is higher compared to the SINK of UE-/c to 
decode its own message. Once this condition is satisfied, all users, which are assumed to be at a 
‘higher’ level in the given ordering, are able to perform SIC. Therefore, we propose to maximize 
the minimum of these ‘direct’ and ‘cross-user’ decoding SINRs. To further exemplify, consider 
a three user system with UE-1 the lowest and the UE-2 the highest in the ordering. Now, assume 
that SINRj > Tth and SINR^, < Tth, w = 2,3, where Tth is some threshold rate. In this scenario, 
both users 2 and 3 are unable to decode the message of UE-1 as the SINR} is at least as large as 
Tth, and therefore, SIC cannot be applied. Motivated by this, we aim at obtaining such precoders 
that ensure, we have Tth < SlNRl, w = 1,2,3. Moreover, the sequence of inequalities in ([4]) 
helps to boost up the desired signal level of the ‘lower’ level users, and in the absence of this 
guarantee it is likely that most, if not all, radio resources are allocated to the users that receive 




very low or no interference. The sum rate, Rsum, therefore, is given by 



k=l 


( 5 ) 


where 



i = 


( 6 ) 


Now, the optimization problem can be formulated as 


maximize i?. 


'Sum 


(7a) 


s. t. IhfcWiI^ > ... |hfcWfc_i|^ > IhfcWfcl^ > |hfcWfc+i|^ .. > IhfcWjvl^ l<k<N (7b) 


N 



(7c) 


i=l 


where the constraint in (TTcl) represents that the total power, which is upper bounded to Pth- It 
is important to mention here that in the original NOMA |[T1, |I3, its concept was applied only 
to two users, but it can be extended to a multiuser setting. Such an extension requires optimal 
grouping [l22l and hence it is left open for future investigation. 


III. Prerequisites 


In order to solve the optimization problem in ([7]), we will eventually present an iterative 
algorithm. However, first it is necessary to transform the original problem and then to apply 
approximations that render it tractability. 

A. Equivalent Transformations 

The problem in (|7]) is non-convex, and it seems that it is not possible to directly approximate 
it, since the only convex constraint is the power constraint. Therefore, several steps need to be 
invoked before we can present an algorithm, which solves this problem approximately. To this 
end, we first introduce the vector r E and we observe that ([7]) can be equivalently written 




as 


maximize 

WiGCr,Vi,reIR^ 


S. t. 



N 


rfc — 1 < min (SINR^,..., SINR^) , k 
rw-I < 


Ih^WTvp 




dVbl) & dVcl), 




(8a) 

(8b) 

(8c) 

(8d) 


where rj,i = are the components of r, and the objective has been obtained by 

eonsidering that log(-) is a non-deereasing funetion, and the geometrie mean of the veetor r, 


i.e., 


n^i f'kj , is eoneave and inereasinglfl. It is well known that the geometric mean can 
be readily expressible as a system of second-order cone (SOC) constraints ll2^ . So this step has 
no negative impact on the tractability of the objective function. However, the overall problem 
still remains intractable. Nonetheless, the original formulation is factored into several different 
constraints, and so, these factors can be processed individually. We first focus on the constraints 
in (l8bl) . and then move to the remaining intractable constraints. Without loss of generality, it 
holds that 


l/N 


Tk 


1 < min (SINR^,..., SINR^) ^ - 1 < 


SINR^ 


(9) 


min (SINR^+i,...,SINRy , 
for /c = 1,..., iV — 1. The constraint in (l8bl) has been purposely written as that in (|9l), since the 
first term SINR^ is different from the remaining ones. Hence, it is necessary to deal with the 
first term and the remaining N — k terms passed as argument of the min(-) function. 

By introducing, w G it holds that 

lhf^W,J2 I WkTk -Wk< 

rk-^< 


i= 


N 
j=k+l 


h^Wj|2 0-2 




( 10 ) 


E,=fc+i |hf Wjf + <Wk, 


where Wk is the fc* component of the vector w, and the expression of SINR^ is used. Likewise, 
for an arbitrary SINR^, k + 1 < j < N, belonging to the remaining terms in the min(-) function, 
we introduce the new variable, v G 


o.5(Af N)^ write the corresponding constraint as the 


^It is not necessary to explicitly constrain the vector r to be positive, since for non-zero data rates this condition holds. 
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following system of inequalities 

N 

TkVj - Vj < |hfwfc|^ ^ \hfwm\‘^ + CT^ < Vj, (11) 

m=/c+l 

where Vj is the j* element of v. Note, that even if the eonstraints in (fSbl) have been transformed, 
the problem remains intraetable. 

From the inequalities in (TTbl) . it holds that 

IhfcWip > . . . |hfcWfc_ip > IhfcWfcp > |hfcWfc+ip . . . > |hfcWArp (12) 


IhfcWjvp < min^e[i,jv_i] |hfcW,^p 




|hfcWfc+ip < min^6[i,fc] |hfcW^p 


= T{k,N) 


(13) 


|hfcW 2 p < IhfcWip. 

Similarly, equivalent transformations, T{1,N) and T{N,N) for k = 1, N can be obtained. We 
eonelude this subseetion by presenting the equivalent formulation of dV]) as 

N \ Jf 


maximize 

ViGC^jVi.rGK^ ,wGK: 


n 


rk 


(14a) 


vGK, 


0 . 5 {N^-N) 


+ ’ \A:=1 


S. t. 


WkTk -Wk< 


Elfc+1 + ct2 < Wk 


k = 1,..., N — 1 


(14b) 


N 


TkVj-Vj < \hfwk\'^, ^ \hfwm\^ + a'^<Vj j = k + l,...,N 

(14c) 


m=k-\-l 


rN -^< 


|h^w 


TV 




r(l, AT), , T{k, N),..., TiN, N) & (173). 


(14d) 

(14e) 


B. Approximation of the non-convex constraints 

Next, we approximate the equivalent formulation in (fT4l) . To this end, note that, excluding the 
power constraint, the first set of constraints in (I14bl) . (ll4cL and the constraints in (I14dl) and (IMel) 
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are all non-convex. The rest of the eonstraints are eonvex, and in faet admit SOC representation. 
Consider the seeond set of eonstraints in (I14bl) i.e., 

/ hf Wfc+i\ 


N 


+ a'^ < Wk ^ 

j=k+l 


hj^WN 

a 

\ 2 / 


< \ 1. (15) 


Similarly, 


/ hfwk+i^ 


N 

\hfwm\^ + a'^<Vj-^ 

m=fc+l 


hfwN 

a 


< + ^ 
- 2 


j =/c + 1,..., iV. 


(16) 


V V / , 

Now, in order to taekle the non-eonvex eonstraints, the so ealled CCP is used. The CCP has 
been widely used in neural eomputing ifTSll . and has reeently found applieations in wireless 
signal proeessing [fT^ . [l20l . The CCP has also been referred to as minorization-maximization 
algorithm (MMA) [[T2]l. ifTTll. 

First, the proeedure of handling the first set of non-eonvex eonstraints in (I14bl) is eonsidered. 
The approximation of the other non-eonvex eonstraints elosely follows the same teehnique. 
Consider the fc'*’ eonstraint 


Wkrk-Wk<\h.^^k?- (IV) 

This is non-eonvex beeause of the bilinear term on the left side and the quadratie term on the 
right side of the inequality. An equivalent transformation of the above inequality is 

WkTk -Wk< (eikf + idk,kf = II Qk,k\\l ^k,k = (hfwfc) , eif, = ^ (hf Wfc) (18) 

where 0^^^ = [Olk^Oh^]^ and f{Qk,k) — Sinee the funetion in the right side of (fTSl) is 

a eonvex one, it follows that lf24]l 

/(0m) = 110^112 > II 0 UII 2 + ^KkViQk,k - ei,k) = 9iQk,k, 0U), (19) 

where the right side of the inequality in (fT9l) is the first order Taylor approximation of the 
funetion ||0fc,fc||2 around 0^ Clearly, this formulation is linear in the variable Qk,k, and will be 
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used instead of the original norm-squared funetion. Three important properties follow here 

f{Qk,k) > g{Qk,k, 0fc,fc), for all 0fc (20a) 

fKk) = 9{el,k,QU), ( 20 b) 


V/(0fc,fc)|e‘_^ = yg{Qk,k, Qk,k)\e\ 


k,k 


(20e) 


where the notation (•)! ^ is used to represent the value of the funetion at 0^ The basie idea 
of the approximation algorithm presented below is to maximize the minorant g{Qk,k, k) 
the variable Qk,k, in order to obtain the next iterate term, 0^^^, i.e., 


OS = max^( 0 fc,fc, 0 fc,fc). 

Using these eonsiderations, it ean be easily eoneluded that 

/(oS) = /(oS) - g{Q\,k. 0^) + g{Q\,k. 0^) 

(a) (6) / N 

> g{Qi,k^ oS) > oU) = /(Om), 


( 21 ) 

( 22 ) 

(23) 


where (a) follows from f{Qk,k) > g{Qk,k, {b) is due to (l2T]) . and the final equality (c) is 
due to (I20bl) . 

Now, to deal with the bilinear produet on the left side of (fTVl) . first we observe that for 
nonnegative Wk,rk it holds that 


WkTk = T [{Wk + Tkf - {Wk - Tkf] . 


(24) 


The quadratie term being subtraeted in the above inequality ean be well approximated by a first 
order Taylor series around Thus, the overall eonstraint in (fTVl) reads as 

0.25(M;fc + rkf - Wk - 0.25 [{wl - rlf + 2{wl - rl){wk - + rl)] < g{Qk,k, Qi,k), 

(25) 

whieh is eonvex in the variables of interest. 

Following similar proeedure the remaining non-eonvex eonstraints in (I14el) . (I14dl) and (IMel) 
ean be approximated as follows. The j-th eonstraint in (IMel) and that in (IMdl) ean be written as 

0.25(rfc + Vjf - Vj - 0.25 [{rl - + 2{rl - vl){rk -rl- Vj + uj}] (26) 


< 5'(0yfc) AT — 1) < g{QN,N, 0^,Ar), 


( 27 ) 
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where 



and represent the points around whieh the quadratic terms have been linearized. 

Finally, the last set of non-convex constraints in (I Mel) can be tackled similarly. To demonstrate 
it, we linearize the first set of constraints in (fT3l) . i.e.. 


|hfcW7v|2< min ¥k,J, 


(28) 


where 



and cjjfc m is th® linearization point. The notation T{k^, N*) is used to represent the approximation 
of the remaining inequalities using this procedure. 


IV. The Proposed Solution 


Having set up the stage as above, in this section the procedure that provides a tractable 
approximation to the sum rate maximization problem is outlined. 

A. The Procedure 

Using the above equivalent transformations and approximations, in the iteration of the 
algorithm outlined in Table HI the following optimization problem is solved 



(29a) 


maximize 




d & dia), k = i,...,N-I 


{Pbt) s.t. 


(29b) 


d & d j = k + l,...,N, d 


(29c) 



represents the collection of all auxiliary variables. For the sake of notational convenience, all 
parameters about which the quadratic terms are linearized in iterate t are defined as 



( 30 ) 




TABLE I 

NOMA/MISO Sum Rate Maximization 
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given randomly generated A° feasible to ([7]). 
t := 0. 

repeat 

1- Solve (|2^ labelled as {Ph). 

2- Set = A*. 

3- Update t ■= t + 1. 

until eonvergenee or required number of iterations. 


The MMA (CCP) algorithm used to solve (l29l) has been summarized in Table HI Note, that 
the eonvergenee eriteria ean vary. For NOMA sum rate maximization, this algorithm terminates 
when the difference between two successive values of sum rate is less than a threshold. This 
aspect is discussed in more detail in section |Vll 


B. Properties of the Proposed Algorithm 

Before describing various characteristics of the algorithm presented above, let us define the 
feasible set, the objective and the set of optimization variables in the iteration, respectively, 
as 


J^t = 
Ot = 


[wj for all z, r, w, V, AI constraints in (Pbt) are satsified] 
r N 1 


max 


^ Tfcl {wj for all 


i, r, w, V, A} e Pt 


.k=l 

Vt = [wj for all r, w, v, A]. 


(31) 

(32) 

(33) 


1) Convergence: 

Proposition 1: The sequence of variables {Vt}t>o is feasible i.e., it belongs to Pq, where Pq 
is the feasibility set of the original problem (fT4l) . 

Proof: Please, refer to Appendix ■ 

Proposition 2: The algorithm in Table U returns a non-decreasing sequence of objective values 
i.e., Ot+i > Ot, and hence it converges. 

Proof: Please, refer to Appendix |Bj ■ 
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In Proposition [2] the property (|2TI) is used. It is important to note that the outeome of this 
proposition remains valid as long as the surrogate is inereasing and does not rely on explieit 
maximization. The inereasing behaviour of all SFs ean be shown by following arguments similar 
to those outlined in [fT^ . IfT^ . As a remark, we point out that when the feasibility set is eonvex 
and eompaet, the algorithm eonverges to a finite value. 

2) KKT Conditions: Under a eouple of technical assumptions the accumulation point of the 
algorithm satisfies the KKT conditions, as summarized in the proposition given below. 

Proposition 3: As the iteration number t tends to infinity, the algorithm in Table U converges 
to the KKT point of (fT4l) . 

Proof: Please, refer to Appendix O ■ 

V. A Reduced Complexity Approximation 

In the original sum rate function given in ([5]), it has been ensured that users with high SNRs 
are able to decode the messages of the weaker ones in the superposition coded signal, and 
hence apply SIC to remove interference from them. Optimal ordering of users depends upon 
physical parameters like, transmit antennas, precoding vectors etc. However, in certain situations 
the channel ordering alone may be sufficient to support the stronger users to decode the weaker 
ones. In scenarios, this is true, only the first term in the min(-) function for a user k needs to 
be retained and the objective becomes 

= E ‘°S2(1 + SINR‘) + log, (1 + j , ( 34 ) 

k=l ^ ' 

From (iMl) it can be seen that A^(A^ —1)/2 SINR terms do not appear in the simplified sum rates. 
In turn, this means that in the formulation of (fT4l) . there are not — N inequality constraints. 


and clearly, a complexity improvement is expected. Before moving on to the complexity analysis 

section, for completeness, the updated optimization problem solved in the run of the algorithm 

in Table H can be written as 

1 



maximize 

/ N \ iv 


(35a) 

{Piff) s.t. 

d & dia), 

fc = l,...,Ar-1, ([271) 

(35b) 




(35c) 
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Fig. 2. Variation of Pr(SINR 2 > SINRi) with the distance of UE-1. T = 6, N = 4 ,7 = 2.0 and distance of UE-2 = 1 is 
fixed. 


where now Ap = { 0 fc,fc, Qn,n, has a redueed cardinality compared to the original set of 

the variable set A. 

In order to provide more insight into the approximation used above let us consider the 
following lemma. 

Lemma 1: Suppose that h^+i = Cfe+ih^, k = 1,. .., iV — 1, so that h„ = c„c„_i... Cfc+ih^, 
where k + 1 < n < N and the magnitudes of the complex constants CnCn-i ■ ■ ■ Ck+i = is 
greater than one. Under this assumption, when Q reduces to (IMl) . then 

||hi||2 < ||h2||2 • • • < ||h7v||2- (36) 


Proof: Please, refer to Appendix |Dj ■ 

From Lemma [H it can be expected that, at least approximately, when that channels are clearly 
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ordered, i.e., the magnitudes of successive channel vectors differ significantly and the channel 
ratio inequalities as given above are satisfied, the problems in (l35l) and (|29|) are equivalent. To 
further highlight, we evaluate below the probability of an event of interest. 

Lemma 2: Consider a random unitary precoding matrix, i.e., W^W = l|^ where W = 
[wi,..., Wat] and W is independent of the channel matrices. For i > j 

Pr (SINRf > SINR^^) (37) 

is given by 

Pr (SINR(= > SINRj) = 1 - ((A* + 2{N - k)) (38) 

- - k)^ ((Ai + Xj)a\2{N - k) + l) , 


where 




(m - 1) • • • (m - 1 - /) 


(m — 1)! 

and Ei(a:) is the exponential integral [l25ll . 

Proof: Please, refer to Appendix |El 
When \j » Xi, (l3^ can be approximated as 

Pr (SINRf > SINR)) ^ 1 - {Xjcr^, 2{N - k)) 

- - k)ilj {Xja^, 2{N - k) + l) . 


(39) 


(40) 


It can be seen from (l40l) that when di decreases (and thus A* decreases), Pr (SINR^ > SINR^^) 
increases as well. Hence, the stronger the channel of UE-z compared to UE-j, the higher the 
probability Pr (SINR^^ > SINRj). In order to further investigate the probability under consid¬ 
eration, in Pig. |2l the variation of Pr(SINR 2 > SINR}) is depicted in terms of the distance 
of UE-1. With a decrease in the channel strength of UE-1, Pr(SINR 2 > SINRJ) increases, 
thereby justifying the use of SINR} instead of SINR^. The probability Pr(SINR 2 > SINR}) 
varies inversely with the noise variance for a given UE-1 distance. It can also be seen that for 
higher values of noise variance, the interference term dominates and the probability that SINR} 
remains below SINR 2 is increased. In addition, this figure also validates the analytical results 
derived above. 


"'Extending this lemma without the orthogonality constraint remains an open problem. 
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Fig. 3. Variation of the per iteration complexity of the exact and approximate NOMA formulations with the number of users 
N for given values of T and c = 10. 


A. Complexity 

In each iteration of the procedure presented in Table H we solve an SOCP. The total number of 
iterations are fixed and only variables are updated in each run of the algorithm. Hence, the worst 
case regarding the complexity is determined by the SOCP in each step. Therefore, to provide a 
complexity estimate, the worst case complexity of the SOCP given by (l29l) or (1^ is estimated. 
It is well known that for general interior-point methods the complexity of the SOCP depends 
upon the number of constraints, variables and the dimension of each SOC constraint ll2^ . The 
total number of constraints in the formulations of (l29l) and (l35l) are 0.5iV^ + 0.5iV^ + 2N + c and 
0.5iV^ — 0.5iV^ + 3iV + c, respectively, where the non-negative integer constant, c, refers to the 
SOC constraints with different N. This happens because of the equivalent SOC representation of 
the geometric mean, given in the objective function, also see ll2^ . Therefore, for both problems 
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the number of iterations needed to reduee the duality gap to a small eonstant is upper bounded by 
C>(\/0.5iV3 + O.SiV^ + 2N + c) and O{\/0.5N^ — 0.5N^ + 3N + c), respeetively ll23l . In order 
to ealeulate the dimension of all SOCs in (l29l) we provide an upper bound because the sums 
of the dimensions for some constraints have been bounded from above by definite integrals 
of increasing functions. This estimate is found to be [1.833iV^ + 3N^ + 8N + NT + 3c — 
5.83333] for (l29l) . The interior-point method’s per iteration worst case complexity estimate of 
dH is C> ((3.5iV2 + 1.5iV + 2NT + c- l)2([1.833iV3 + 3N^ + 8N + NT + 3c - 5.83333])), 
where 3.5iV^ + 1.5iV + 2NT + c — 1 is the number of optimization variables in (l29l) . Likewise, 
the interior-point method’s per iteration complexity to solve the SOCP in (1351) is given by 
C>((2iV2+3iV+2iVT+c-l)2(1.5iV3-iV2 + 10.5iV+iVT+3c-4)),where2iV2+3iV+2iVT+c-l 
and 1.5iV^ — -f 10.5iV -f NT -f 3c — 4 are the optimization variables and the total dimension 
of the SOC constraints in (1351) . 

To provide further insight, we plot the per iteration complexity estimates of the SOCPs in Fig. 
Ill The SOCP in (l29l) is called complete NOMA (C-NOMA), while the one in (1351) is dubbed 
as approximate NOMA (A-NOMA). The figure quantifies the increase in the complexity as a 
function of both N and T. 


VI. Numerical Results 

In this section we investigate the performance of the proposed solution to the NOMA sum rate 
maximization problem. For a given set of antennas T and users N, the channels as hj = \Jdp^i 
are generated, where g* ~ CAA(0,1), and the distances of all users are fixed, such that they are 
equally spaced between distances of 1 and Dq from the BS. It should be noted here that in 
simulations the user distances are fixed and the average is taken over the fast fading component 
of the channel vectors. For each set of results the values of 7 and Dq are mentioned, while it 
is assumed that a = 1 for all users. Similarly, the transmit power is normalized with respect to 
noise, whose variance is taken to be unity. For the simulations the CVX package [|2^ is used. 

In Fig. m we plot the average sum rates versus the transmit power for a three user system 
and a BS equipped with three antennas. We take 7 = 2, Dq = 50, and therefore, the three 
users are placed at 1,25.5 and 50 meters from the BS, respectively. Unless specifically pointed 
out, 7 and Dq retain the same value. It is noted that for transmit power up to 25 dB, the sum 
rates of the complete NOMA (C-NOMA) formulation and the its approximation (A-NOMA) 
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Fig. 4. Achievable sum rates vs. normalized transmit power called TX-SNR. We take T = N = 3, Do — 50 ,7 = 2 and a = 2. 


are equal. This observation is beeause of the distanee effeet, the ordering of the ehannels 
||hi ||2 < ||h 2||2 < ||h 3||2 is valid for all realizations of gj. As a consequenee, SINK} < 
min (SINR 2 , SINR 3 ) , and SINR 2 < SINR 3 . Therefore, the objeetive funetion in ([5]) matehes 
with that in (IMl) . Beeause of the wide range of multiplieative distanee faetor, this observation 
ean be attributed as a result of the Lemma [H Onee the transmit power crosses a certain value (25 
dB in our case), the ordering of users need not to be the optimal one and hence the two curves 
deviate from each other. The A-NOMA approach produces better rates because the interference 
free rates of the last user are boosted more compared to the C-NOMA. This comes with a 
degradation in the sum rates of the N — 1 users (excluding UE-A^) as we will see in the next 
experiment. Interestingly, the competing zero-forcing (ZF) solution performs very poorly for 
lower SNRs, and only produces significant sum rates, when the transmit power is sufficiently 
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Fig. 5. Variation of average sum rates with TX-SNR for different Dq. N = T = 4,a = 1 and 7 = 2.0. 

high. This poor performance of the ZF scheme can be attributed to the distance effect, which 
makes the channel matrix poorly-conditioned IITtII . At higher transmit SNRs this poor condition 
of the channel matrix is partially circumvented and hence a notable increase in ZF rates is 
observed. 

The next set of results presented in Fig. [5] depict the average sum rates of all users excluding 
UE-A^ as a function of transmit power, with N = T = 4. Basically, Fig. [5] can be seen as 
complementing the observations made in Fig. HI where also at high transmit SNR A-NOMA 
has better total sum rates, compared to C-NOMA. It is seen that for low SNRs the curves 
for A-NOMA and C-NOMA overlap. As the the transmit power is further boosted, C-NOMA 
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Fig. 6 . The achievable sum rates as a fucntion of TX-SNR for different values of Do- The parameters taken are N — T = 
4, (7 = 1 and 7 = 2.0. 


outperforms A-NOMA. The reason for the equality of the rates in both teehniques is the same 
as mentioned above. However, at higher transmit SNR the C-NOMA provides better data rates, 
because of lack of optimality in the users’ ordering, the beamformers of A-NOMA will not 
necessarily produce optimum min (SINR;^) for all k. In addition, we have also included curves, 

k<j<N 

when Dq is decreased from 50 to 10 meters. It is evident that because of the shorter distance 
the net effect of distance attenuation, which orders the channels, is diminished. Hence, the gap 
between the graphs of C-NOMA and A-NOMA is enlarged. Nonetheless, overall higher data 
rates are reported in this case because of better channel conditions for the users. 

It can be concluded from the previous discussions, that distance plays an important role in 
determining the aggregate data rates of the NOMA system. Therefore, to further explore its 
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Fig. 7. Iterations required for convergence in C-NOMA approach. We take T = W = 5, cr = 1, Dq = 10 and 7 = 2. 

impact we set N = T = 4 and plot the curves for the sum rates of C-NOMA and ZF, with 
7 = 2. The sum rates of the ZF scheme are shown in Fig. as the distance Dq is deereased from 
50 m to 10 m. As the distanee is deereased, the effeet of path loss is minimized and we have 
better eonditioned channel matriees. Therefore, the sum rates of ZF are eonsiderably enhaneed 
at Dq = 10 m. 

In order to investigate the eonvergenee of the proposed algorithm, we eonsider a downlink 
system with T = 5 antennas, serving N = 5 users. As a stopping eriteria, we use successive 
values of the sum rate returned by the algorithm. The algorithm exits from the main sequential 
iteration loop, when the differenee between two eonseeutive values of the sum rate is less than 
or equal to 10“^. With this eriterion, as shown in Fig. |71 the algorithm converges within 25 
iterations for the three values of transit SNR shown in the figure. Moreover, as expected, with 
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Fig. 8. The effect of TX-SNR on achievable average sum rates when N > T. The parameters used are T — = 1,'y = 

2.0 and Do = 50. 


higher transmit power, we obtain better sum rate. 

As a multiuser system is considered, the proposed approach is expected to deliver acceptable 
spectral efficiency when N > T. The results reported in Fig. [8] show the performance of C- 
NOMA, when the number of users N is greater than the number of transmit antennas T = 3. 
For comparison we have also included the sum rates achieved by the C-NOMA and ZF solutions 
with N = 3 users only. To obtain these two curves, we randomly pick three users to be served 
with C-NOMA and ZF precoders. It is evident that with fewer users C-NOMA underperforms. 
Since, in this case, users are randomly chosen, it is likely that the effective multiuser diversity 
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If28]| is lost and we see a downward trend in aehievable data rates. 

VII. Conclusion 

In this paper, we have studied the sum rate maximization problem of a MISO downlink system 
based on NOMA. Speeifieally, we approximate the originally non-eonvex optimization problem 
with a MM method. For the proposed algorithm, we have solved an SOCP with polynomial 
eomputational eomplexity in eaeh step. For the seenarios eonsidered, the algorithm is numerieally 
shown to eonverge within a few iterations. Furthermore, we developed a redueed eomplexity 
approximation and explore the eonditions under whieh it is tight. Finally, we provide an insight 
into the tightness of the proposed approximation. Our experimental results reveal that the NOMA 
has a superior performanee eompared to eonventional orthogonal multiple aeeess sehemes. High 
data rates are obtained with small transmit power. The distanee attenuation has a very low impaet 
on NOMA performanee. NOMA partieularly outperforms ZF when the number of users is higher 
than the transmit antennas, thus making it an ideal eandidate for enabling multiple aeeess in the 
next generation 5G networks. 


Appendix A 

Proof of Proposition □ 

Without loss of generality, we foeus on the funetion f{Qk,k), its approximation g{dk,k, ^kk) 
and the eonstraint in whieh it appears. The same arguments will be applieable to all non-eonvex 
funetions, their eonvex minorants and the respeetive eonstraints. Therefore, it holds that 

0.25{wk + Tkf -Wk- g{wk, Tk, wl, rl) < g{dk,k, 0^,^), (41) 

where giwk, rk, wl, rl) = 0.25 [{tvl - rl)"^ + 2{wl - rl){wk -wl-rk + rl}] is the approxima¬ 
tion of the original funetion {wk — rk)"^- Note, that this eonstraint is a eonvex approximation 
of that in (fTVl) . Now, let us assume that the tuple {wl,rl, 0^) is feasible to (fTVl) . Clearly, the 
same point also satisfies (l4TI) as a eonsequenee of (I20bl) . Sinee g{wk,rk,wl,rl) < {wk — rk)‘^ 
and /(0fc,fc) > 5'(0fc,fc, 0^), it follows that 

0.25(w;a: + rk)‘^-Wk - 0.25(w;fc - - f{Qk,k) (42) 

< 0.25(w;fc -f rfc)^ - Wk - g{wk, rk, wl, rl) - g{9k,k, Ql,k)- (43) 




26 


Hence, should satisfy (fTTl) because 

0.25(ffii+‘+r‘+')'' - !»'+■ - 0.26(ffi‘+' - r ‘+^)‘^ - /(0«) 


<0.25(ffif‘+r‘+‘)- 




W 


g{Wj^ ,rj^ 5'(0fcjc, 0fcjc) < 0. 


The above conclusion holds for all k and {Vt}t>o, as the algorithm was initialized with A 


( 0 ) 


(44) 

(45) 
G -Fq- 


Appendix B 

Proof of Proposition [2] 

In order to prove this proposition, we note that 3 F). From (|2^ it is clear that the 
surrogate functions used in place of non-convex terms are non-decreasing with iteration number 
i.e., > SF^, where SF is a generic representation of these functions used in the paper and 

is valid for all of them. Therefore, Ft+i 5 Ft, is an immediate consequence, and the statement 
in Proportion [2l follows. Hence, {Ot}t>o is non-decreasing, and possibly converges to positive 
infinity. 


Appendix C 

Proof of Proposition [3] 

The following assumptions are made before outlining the arguments. 

Assumption 1: We assume that as f —)■ oo, the sequence of variables {Vt}t>o generated by 
the algorithm in Table U converges to a value V*. 

Assumption 2: The constraints in the approximate problem (l29l) or (1351) are qualified at the 
accumulation point. 

Without explicitly mentioning the constraints, we use abstract notation to prove the claim made 
in Proposition |3l First let us give a generic representation to all convex constraints in (l29l) as 
Ca(V)^ < 0, a = 1,..., Li, where (V)^ denotes the subset of Vt containing the corresponding 
variables that appear in these constraints. Similarly, let us define as C^(V)p < 0, 6 = Li-l-1,..., L 2 
the constraints obtained by approximating the non-convex functions with convex minorants in 
dH, and (V)p C Vt- Let p*, pI E M+ for all a, b, denote the dual variables at convergence. The 
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KKT conditions of the problem in (|2^ at (V*)^, (V*)^ then read as 

Li L 2 

Vr* + ^I7:VC.(V).+ ^ ij;vcl{vn,^0 (46) 

a=l b=Li+l 

V:Ca{V*)s = 0, a = I,..., L^, vlCiiV*), = 0, b = L, + I,..., L^. (47) 

Sinee all eonvex minorants satisfy the properties in (l20l) . it is easy to conelude that the KKT 
eonditions given above will reduee to those of the problem in (|29|) . Similar eonelusion also holds 
for the simplified problem in (1351) . 


For 


Appendix D 
Proof of Lemma [H 

to be valid for all 1 < /c < — 1, it holds that 

SINR^ < min SINK?. 


iG[fc+l,Af] 

For an arbitrary /c G [1, iV — 1] and i = n, let us eonsider the following inequality, 


Ih^Wfcl 


IF-H'w |2 n-2 

Z^m=fc+1 '^rn\ T '-'n Z^m=fe+1 ’''"il 
where we have assumed that the noise varianees at the n* and the fc* nodes are (T^ and a1, 

respeetively. By substituting the assumptions made in the lemma, h„ = c„c„_i... Cfc+ih^ = 

c^^h.k, where k + 1 < n < N. After some simple manipulations 


> 


l^k^k 


|h^w^|2 + a: 


2’ 

k 


(48) 


(49) 


N 


N 


+ + aE|c; 

7n=k-\-l m=k-\-l 

Now, if an = CTk, using the eondition on we obtain ||h „||2 

the same argument for all k, the required proof follows. 


n \ Tn 


> - . 

O'k 


(50) 


> ||hfc ||2 for all n. Repeating 


Appendix E 
Proof of Lemma [2] 


The SINR)', i > k, ean be written as 


SINRf = 


|hfw,|2 


/L^m=. 


m=fc+l 


|hfwmP + 0-2 


(51) 


If a random unitary matrix is used for preeoding, |hf^Wfcp is still eomplex Gaussian distributed, 
sinee a unitary transformation of Gaussian veetors is still eomplex Gaussian distributed. In 
addition, |hf^Wfcp and |hfw;p, k ^ I are independent. Define Xik = Ihf^w^p and yik = 
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Xlm=fc+i Therefore Xik is an exponentially distributed random variable, with Aj = d'l. 


i.e., fxiki^) = AjC Similarly, i/ik follows the Chi-square distribution, i.e.. 


fyjy) = 7 ^ 


_XjV-fc^(7V-fc-l) 


->^iy 


{N-k-l)\ 


(52) 


Consequently the eumulative distribution funetion of SINRf can be calculated from the following 


Pr (SINR,^ < 0) = Pr 


^ik 


< e 


Uik + cr2 
1 - f,Jv)dy 

„— Xi6a ^ roo 


= 1 - 




{N-k-l)\ 

Applying [l25l Eq. (3.351.3)], the pdf of SINR^^ can be obtained as follows: 

p-XiCr'^z 

-^SINRf (^) = 1 “ 


(53) 

(54) 

(55) 

(56) 


(1 + z)^-^' 

Again, following the unitary transformation of Gaussian variables, the desired probability can 
be evaluated as 


Pr (SINRf > SINR^^) = 




1 - 


(1+^) 


N-k 


/sinr((^)'^^ 


= 1 




/o (l + z)2(^-'=) (1 + .s)2(^-^)+i 

Applying [|25l Eq. (3.351.4)], the above probability can be expressed as 

Pr (SINR(= > SINRJ) = 1 - ((A* + Xj)(T\ 2{N - k)) 

_ e(A.+A,P= (jv _ + x,)a\ 2{N - k) + l) , 

and the proof is completed. 


dz. 


(57) 

(58) 

(59) 
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